首页> 外文OA文献 >Study on Quality Improvement of HMM-Based Synthesized Voices Using Asymmetric Bilinear Model
【2h】

Study on Quality Improvement of HMM-Based Synthesized Voices Using Asymmetric Bilinear Model

机译:基于非对称双线性模型的基于HMM的合成语音质量改进研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Hidden Markov model (HMM)-based synthesized voices are intelligible but not natural especially under limited-data conditions due to over-smoothed speech spectra. Improving naturalness is a critical problem of HMM-based speech synthesis. One solution is to use voice conversion techniques to convert over-smoothed spectra to natural spectra. Although conventional conversion methods transform speech spectra to natural ones to improve naturalness, they cause unexpected distortions in the intelligibility of synthesized speech. The aim of the study is to improve naturalness without reducing the intelligibility of synthesized speech by employing our novel asymmetric bilinear model (ABM) to separate the intelligibility and naturalness of synthesized speech. In the study, our ABM was implemented on the modulation spectrum domain of Mel-cepstral coefficient (MCC) sequences to enhance the fine structure of spectral parameter trajectory generated from HMMs. Subjective evaluations carried out on English data confirmed that the achieved naturalness of the method using the ABM involving singular value decomposition (SVD) was competitive with other methods under large-data conditions and outperformed other methods under limited-data conditions. Moreover, modified rhyme test (MRT) showed that the intelligibility of synthesized speech was well preserved with our method.
机译:基于隐马尔可夫模型(HMM)的合成语音是可理解的,但不是自然的,尤其是在由于语音频谱过度平滑而在有限数据条件下。改善自然性是基于HMM的语音合成的关键问题。一种解决方案是使用语音转换技术,将过度平滑的频谱转换为自然频谱。尽管常规转换方法将语音频谱转换为自然频谱以提高自然度,但它们会导致合成语音的清晰度出现意料之外的失真。该研究的目的是通过使用我们的新型非对称双线性模型(ABM)来分离合成语音的清晰度和自然度,从而在不降低合成语音的清晰度的情况下提高自然度。在这项研究中,我们的ABM在Mel倒谱系数(MCC)序列的调制频谱域上实施,以增强HMM产生的频谱参数轨迹的精细结构。对英语数据进行的主观评估证实,使用大数据条件下的包含奇异值分解(SVD)的ABM方法所获得的自然性与其他方法相比具有竞争优势,而在有限数据条件下则优于其他方法。此外,改进的韵律测试(MRT)表明,使用我们的方法可以很好地保留合成语音的清晰度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号